COMPARATIVE  ASSESSMENT  OF  SIX  CASE-MIX 
CLASSIFICATION  SYSTEMS  WHF ^  APPLIED  TO 
SIMULATED  DoD  POPULATIONS 


Contract  Number:  MDA903-88-C-0071 
Task  Order  No.  8-90/91 


Task  Order  Proponent:  Lieutenant  Colonel  Stuart  W.  Baker.  M.S. 


Submitted  To: 

U  S.  Department  Of  Defense 
Office  of  the  Assistant  Secretary  of  Defense 
Health  Affairs 

Three  Skyline  Place,  Room  1507 
5201  Leesburg  Pike 
Falls  Church,  VA  22041-3201 


This  document  was  prepared  by  Paul  Mermelstein,  Project  Director;  David  Bodycombe,  Project  Manager; 
Scott  Armstrong,  Lead  Project  Analyst;  Jim  Love,  Project  Analyst;  and  Anthony  Avens,  Project  Analyst. 


(This  document  has  been  approved 
for  public  release  and  sale;  its 
distribution  is  unlimited. 


Birch&Dtavis  Associates.  Inc. 

8905  FAIRVIEW  ROAD 
SUITE  300 

SILVER  SPRING.  MARYLAND  20910 
(301)  589-6760 


December  30,  1991 

92  6  2f>  o  of> 


92-17026 


REPORT  DOCUMENTATION  PAGE 


Form  Approved 
OM3  No.  0704-0188 


1.  AGENCY  USE  ONLY  (Leave  blank)  2.  REPOST  DATE  3.  REPORT  TYPE  AND  DATES  COVERED 

30  December  1991 


4.  TITLE  AND  SUBTITLE 

COMPARATIVE  ASSESSMENT  OF  SIX  CASE-MIX  CLASSIFICATION 
SYSTEMS  WHEN  APPLIED  TO  SIMULATED  DOD  POPULATIONS 

5.  FUNDING  NUMBERS 

MDA903-88-C-007 1 

TA-8-90/91 

6.  AUTHOR(S) 

Birch&Davis  Associates,  Inc. 

Task  Order  Proponent:  Stuart  W.  Baker,  LTC,  MS,  USA 

7.  PERFORMING  ORGANIZATION  NAM£(S)  AND  ADDRESS(ES) 

Birch&Davis  Associates,  Inc. 

8905  Fairview  Road 

Suite  300 

Silver  Spring,  MD  20910 

i 

i 

8.  PERFORMING  ORGANIZATION 

REPORT  NUMBER 

MDA903-88-C-007 1 

9.  SPONSORING /MONITORING  AGENCY  NAME(S)  AND  ADDRESS(ES) 

OASD/HA/HSO/RAMS 

3  Skyline  Place,  Suite  1507 

5201  Leesburg  Pike 

Falls  Church,  VA  22041-3203 

10.  SPONSORING /MONITORING 

AGENCY  REPOST  NUMBER 

11.  SUPPLEMENTARY  NOTES  j 

1 

i 

_ J 

12a.  DISTRIBUTION  /AVAILABILITY  STATEMENT 

Approved  for  Public  Release:  Distribution  is  Unlimited 

! 

12b.  DISTRIBUTION  CODE 

13.  ABSTRACT  (Maximum  2C0  words) 

This  report  presents  the  results  of  a  comparative  analysis  of  six  case-mix  classifi¬ 
cation  systems  when  applied  to  a  simulation  of  DoD  patient  populations.  This 
represents  the  culmination  of  a  series  of  reports  that  assessed  each  of  the  case-mix 
classification  schemes.  Case-mix  classification  schemes  tested  were  Ambulatory  Work 
Units  (AWUs) ;  Civilian  Health  and  Medical  Program  of  the  Uniformed  Services  (CHAMPUS) 
visit  charges;  Ambulatory  Visit  Groups  (AVGs)  based  on  CHAMPUS  charges;  AVGs  based 
on  the  Resource-Based  Relative  Value  Scale  (RBRVS) ;  Ambulatory  Patient  Groups  (APGs) 
based  on  CHAMPUS  charges;  and  APGs  based  on  RBRVS.  Simulations  were  used  to 
facilitate  a  comparison  of  these  competing  classification  schemes. 


la.  SUBJECT  terms  Case-Mix;  Ambulatory  Case-Mix;  Ambulatory  Care 
Classification;  Case-Mix  Classification;  Ambulatory  Patient 
Groups  (APGs);  Resource-Based  Relative  Value  Scale  (RBRVS); 
CHAMPUS;  Ambulatory  Work  Units  (AWUs). 


15.  NUMBER  OF  PAGES 
120 


16.  PRICE  CODE 


117.  SECURITY  CLASSIFICATION  118.  SECURITY  CLASSIFICATION  19.  SECURITY  CLASSIFICATION  20.  LIMITATION  OF  ABSTRACT 

1  OF  REPORT  OF  THIS  PAGE  OF  ABSTRACT 


j  Unclassified 


MSN  7540-0 ' -230-3500 


Unclassified 


Unclassified 


$tanc3'd  Perm  298  f^ev  2-89) 

*’rF*cj  t.  *  A’jSi 


CONTENTS 


EXHIBITS 


Number 
III- 1 

III-2 

III-3 

III-4 

III-5 

III-6 

III- 7 

IV-  1 


RESOURCE  INTENSITY  BY  METHOD- -FACILITY 
SIMULATIONS 

DIFFERENCE  BETWEEN  AWUs  AND  CHAMPUS 
DOLLAR-BASED  GROUPERS- -FACILITY 
SIMULATIONS 

DIFFERENCE  BETWEEN  AWUs  AND  RBRVS-BASED 
GROUPERS- -FACILITY  SIMULATIONS 

RESOURCE  INTENSITY  BY  METHOD- -SPECIAL 
SCENARIO  SIMULATIONS 

DIFFERENCE  BETWEEN  AWUs  AND  CHAMPUS 
DOLLAR-BASED  GROUPERS- -SPECIAL  SCENARIO 
SIMULATIONS 

DIFFERENCE  BETWEEN  AWUs  AND  RBRVS-BASED 
GROUPERS- -SPECIAL  SCENARIO  SIMULATIONS 

DIFFERENCE  BETWEEN  AWUs  AND  GROUPERS-- 
SIMULATION  12  (COMPLEX  CASE-MIX  VISITS) 

SUGGESTED  CRITERIA  FOR  EVALUATING  CASE-MIX 
CLASSIFICATION  METHODS 


Page 

III-2 

III-2 

III-2 

III-3 

III-3 

III-3 

III- 3 

IV-  1 


Birch  &  Davis  Associates.  Inc. 


Page  ii 


EXECUTIVE  SUMMARY 


The  following  report  presents  the  results  of  a  comparative  analysis  of  six  case-mix  classification 
systems  when  applied  to  a  simulation  of  DoD  patient  populations.  This  represents  the 
culmination  of  a  series  of  reports  that  assessed  each  of  the  case-mix  classification  schemes. 

This  executive  summary,  which  parallels  the  structure  of  the  main  report,  includes  the  following 
sections: 

•  Introduction 

•  Methodology 

•  Results 

•  Conclusions  and  Recommendations 

1.  INTRODUCTION 

The  purpose  of  this  effort  was  to  assess  which  of  six  case-mix  classification  methodologies  was 
most  effective  as  a  tool  for  allocating  ambulatory  care  resources  within  the  DoD.  The  following 
case-mix  classification  schemes  were  tested  using  simulated  Medical  Treatment  Facility  (MTF)- 
level  data: 

•  Ambulatory  Work  Units  (AWUs) 

•  Civilian  Health  and  Medical  Program  of  the  Uniformed  Services  (CHAMPUS) 
visit  charges 

•  Ambulatory  Visit  Groups  (AVGs)  based  on  CHAMPUS  charges 

•  AVGs  based  on  the  Resource-Based  Relative  Value  Scale  (RBRVS) 

•  Ambulatory  Patient  Groups  (APGs)  based  on  CHAMPUS  charges 

•  APGs  based  on  the  RBRVS 

In  order  to  facilitate  a  comparative  assessment  of  these  competing  classification  schemes,  we 
created  12  "simulations."  Of  these  12  simulations,  six  represent  the  ambulatory  patient  case-mix 
of  MTFs  derived  from  the  demographic  profiles  of  actual  facilities  and  morbidity  data  from 
three  study  databases.  An  additional  six  simulations  were  created  to  test  the  sensitivity  of  the 
various  case-mix  classification  schemes  to  patient  population  extremes. 

The  objectives  of  this  present  effort  were  to  (I)  simulate  the  health  care  utilization  of  actual  as 
well  as  demographically  unique  MHSS  service  populations,  (2)  conduct  a  comparative  assessment 
of  alternative  case-mix  classification  methodologies  in  describing  ambulatory  care  resource  use 
within  simulated  service  populations,  and  (3)  use  the  RBRVS  as  a  substitute  for  CHAMPUS 
dollars  in  computing  resource  use. 
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2. 


METHODOLOGY 


In  order  to  facilitate  the  comparison  of  the  case- mix  classification  schemes,  we  created  12 
simulation  databases  containing  2,000  patients  and  a  year’s  worth  of  their  encounter  data.  The 
first  six  simulations  were  based  on  actual  MTF  demographics,  while  the  second  six  represented 
special  scenarios. 

2.1  Creation  Of  Simulation  Patient  Records 

In  an  effort  to  provide  diversity,  the  decision  as  to  which  MTF  facilities  to  simulate  was  made 
based  on  the  following: 

•  MTF  demographics  taken  from  Deers/RAPS 

•  The  need  for  a  mix  of  Army,  Navy,  and  Air  Force  facilities 

•  The  need  for  a  mix  of  Medical  Centers  and  Community  Hospitals 

In  order  to  highlight  the  reactions  of  the  classification  schemes  to  specific  populations,  the 
following  six  special  scenario  simulations  were  developed: 

•  Children  (Age<15) 

•  Adults  (Age  45-64) 

•  Active  Duty 

•  Complex  Case-Mix  Visits 

•  Women  of  Childbearing  Age  (Age  18-44) 

•  Over  65  Years  of  Age 

To  create  the  simulation  databases,  we  took  the  following  steps: 

•  We  created  a  patient  selection  list  based  upon  the  unduplicated  patients  found  in 
our  three  study  databases  ( 1 )  the  Uniformed  Services  Treatment  Facilities  (USTF) 
database,  (2)  the  CHAMPUS  database,  and  (3)  the  Ambulatory  Care  Database 
(ACDB). 

•  We  randomly  selected  2,000  patients  from  the  simulation  patient  selection  list  for 
each  of  the  scenarios  to  be  simulated  based  upon  stratum  (age  and  sex)  and 
beneficiary  status  (active  duty  or  other),  except  for  the  complex  case-mix 
simulation,  which  was  based  on  visits  with  high  relative  values. 

•  For  the  selected  patients,  we  retrieved  all  of  their  encounters  from  their 
respective  databases. 

•  Based  upon  each  record’s  source,  we  attached  a  weighting  variable  to  create  an 
approximation  of  one  year  of  data. 

•  We  then  used  an  algorithm  to  add  procedures  to  visits  that  did  not  have  them. 
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There  was  one  deficiency  that  could  not  be  effectively  addressed  in  this  five-step  process.  The 
databases  (i.e.,  ACDB,  CHAMPUS,  and  USTF)  failed  to  adequately  reflect  pregnancy  visits 
because  these  visits  were  often  bundled  with  data  for  the  actual  delivery.  Therefore,  we  took 
the  following  extra  steps  in  creating  the  simulation  for  women  of  childbearing  age: 

•  Patient  data  were  randomly  adjusted  to  reflect  age-specific  fertility  rates 
provided  by  the  National  Center  for  Health  Statistics. 

•  Standard  initial  and  follow-up  prenatal  care  visits  were  constructed  based  upon 
guidelines  of  the  American  College  of  Obstetrics/Gynecology,  the  American 
Public  Health  Association,  and  the  opinion  of  a  medical  records  expert.  (See 
Appendix  B  for  the  specifics  on  these  visits.) 

•  The  number  of  visits  assigned  to  each  "pregnant"  woman  was  then  decided 
probabilistically  for  a  one-year  study  period.  In  other  words,  each  pregnancy 
could  have  occurred  fully  or  partially  during  the  one  year  of  inquiry  and,  thus, 
the  number  of  prenatal  care  visits  was  varied  to  reflect  the  interval  within  the 
study  period. 

2.2  Development  And  Attachment  Of  Resource  Measures 

After  the  patient  encounter  information  had  been  compiled  for  each  simulation,  steps  had  to  be 
taken  to  attach  the  different  case-mix  classification  weights  to  each  visit,  so  that  an  analysis  of 
the  effects  of  the  various  simulations  could  be  performed.  These  steps  were: 

•  Develop  AVG  and  APG  relative  values  based  on  Hsiao’s  RBRVS 

•  Group  all  visits  and  attach  each  of  the  resource  measures--mean  CHAMPUS 
billed  charge,  CHAMPUS  dollar-based  AVG  and  APG  relative  values,  RBRVS- 
based  AVG  and  APG  relative  values,  and  AWU  weights 

2.3  Analytical  Approach 

There  were  two  steps  taken  in  analyzing  the  simulation  data.  First,  for  each  simulation,  the  top 
10  diagnoses,  procedures,  AVGs,  APGs,  and  Uniform  Chart  of  Account  (UCA)  codes  were 
compiled  along  with  a  graphical  representation  of  the  demographics.  (This  information  is 
provided  in  Appendix  D.)  These  characteristics  were  then  analyzed  to  assess  the  face  validity 
of  each  simulation.  Second,  standardized  weights  for  each  case-mix  classification  scheme  were 
calculated  for  each  simulation,  so  that  the  results  could  be  compared  across  simulations  as  well 
as  allocation  schemes. 

2.4  Limitations  Of  The  Simulation 

The  limitations  of  this  simulation,  for  the  most  part,  stem  back  to  the  three  source  databases 
(ACDB,  CHAMPUS,  and  USTF)  and  have  been  documented  in  past  reports.  Since  we  were  able 
to  overcome  almost  all  of  the  data  limitations  that  we  encountered,  the  results  of  the  simulation 
showed  excellent  face  validity  and  should  prove  to  be  useful  to  the  DoD  in  choosing  a  case-mix 
classification  system. 
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3. 


RESULTS 


3.1  Characteristics  Of  Each  Simulation 

Before  we  could  undertake  any  analyses  regarding  the  case-mix  classification  schemes,  a 
thorough  assessment  of  the  validity  of  each  simulation  was  performed  in  terms  of  selected 
population  characteristics. 

3.1.1  Facility-Specific  Simulations 

All  of  the  facility-specific  simulations  are  numbered  in  order  to  ease  the  presentation  of  results, 
as  follows: 


•  Simulation  1 --Community  Hospital 

•  Simulation  2--Medical  Center 

•  Simulation  3--MedicaI  Center 

•  Simulation  4--Community  Hospital 

•  Simulation  5--Community  Hospital 

•  Simulation  6--Community  Hospital 

The  demographics  of  these  simulations  were  tailored  to  reflect  the  service  populations  of  actual 
MTFs  and,  therefore,  show  good  face  validity.  Also,  since  the  populations  are  not  radically 
different,  it  would  be  expected  that  the  top  10  diagnoses  for  each  of  the  facility-specific 
simulations  should  be  very  similar.  This  was  indeed  the  case. 

3.1.2  Special  Scenario  Simulations 

The  simulation  numbers  and  the  special  scenario  that  they  represent  are  as  follows: 

•  Simulation  7--Children 

•  Simulation  8--Active  Duty 

•  Simulation  9--Women  of  Childbearing  Age 

•  Simulation  10--Adults 

•  Simulation  ll--Over  65  Years  of  Age 

•  Simulation  12--Complex  Case-Mix 

These  simulations  yielded  case-mix  patterns  that  were  commensurate  with  their  age  distributions. 

3.2  Results  Of  Analyses  Of  The  Case-Mix  Classification  Schemes 

Analyses  of  the  results  of  applying  the  six  case-mix  classification  schemes  to  each  of  the 
simulations  provided  some  insights  into  the  characteristics  of  each  allocation  strategy. 
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3.2.1 


Facility-Specific  Simulations 


The  results  of  our  analysis  of  the  case-mix  classification  schemes  as  applied  to  the  facility- 
specific  simulations  are  shown  in  Exhibits  III- 1 ,  III -2,  and  III-3.  The  most  prominent  results 
of  this  analysis  were  the  following: 

•  There  was  a  relatively  high  degree  of  homogeneity  between  allocation 
methodologies  in  terms  of  resource  intensity.  However,  the  effect  of  choosing  one 
alternative  over  another  could  change  the  level  of  a  resource  budget  by  plus  or 
minus  8  percent. 

•  AWUs  were  the  least  sensitive  to  facility-specific  differences  in  case-mix. 

•  Measures  based  on  CHAMPUS-billed  charges  are  most  sensitive  to  facility- 
specific  differences  in  case-mix,  as  one  should  expect,  because  of  the  overall 
greater  dispersion  of  raw  CHAMPUS  dollars. 

•  RBRVS-based  measures  show  a  constrained  variation  more  comparable  to  AWUs, 
because  the  raw  RBRVS  weights  do  not  vary  as  much  as  raw  CHAMPUS  dollars. 

3.2.2  Special  Scenario  Simulations 

The  results  of  applying  the  case-mix  classification  schemes  to  the  special  scenario  simulations 
are  shown  in  Exhibits  III-4,  III-5,  III-6,  and  III- 7.  The  most  prominent  results  of  this  analysis 
were  the  following: 

•  As  with  the  facility-specific  simulations,  the  following  results  were  once  again 
evident. 

AWUs  were  the  least  sensitive  to  facility-specific  differences  in  case-mix. 

Measures  based  on  CHAMPUS-billed  charges  are  most  sensitive  to 
differences  in  case-mix,  as  one  should  expect. 

RBRVS-based  measures  show  a  constrained  variation  more  comparable  to 
AWUs. 

•  There  was  high  face  validity  in  terms  of  expected  behavior  of  the  case-mix 
methods. 

•  Excluding  the  complex  case-mix  simulation,  the  Over  65  Years  of  Age  simulation 
is  highest  in  resource  intensity  regardless  of  the  case-mix  system  employed. 

•  Alternatively,  the  simulation  of  Children  had  the  lowest  indices  of  resource 
intensity. 
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•  Neither  APGs  nor  AVGs  offered  any  resourcing  advantages  regarding  Active 
Duty  Personnel.  This  may  be  because  AWU  weights  are  based  upon  all  population 
segments  in  each  clinic.  Since  active  duty  personnel  generally  represent  the 
healthier  segment  of  a  population,  their  visits  usually  require  fewer  resources  than 
average  in  each  clinic. 

•  Only  RBR  VS-based  APGs  were  not  advantageous  to  Women  of  Childbearing  Age. 

•  AWUs  were  by  far  the  least  sensitive  to  the  complex  case-mix  simulation. 

•  The  simulation  of  complex  case-mix  visits  yielded  the  highest  resource  values  of 
all  the  simulations,  regardless  of  the  case-mix  system. 

4.  CONCLUSIONS  AND  RECOMMENDATIONS 

The  results  of  our  analysis  do  not  conclusively  point  to  any  of  the  case-mix  classification 
methodologies  as  being  clearly  superior.  Each  of  the  systems  has  its  merits  and  its  drawbacks, 
although  none  of  the  systems  has  a  weakness  that  is  serious  enough  to  completely  exclude  it  from 
consideration. 

For  measuring  resources,  we  feel  that  the  RBRVS  is  better  than  CHAMPUS  dollars  because  it: 

•  Represents  a  more  comprehensive  approach  to  costing,  that  considers  three 
dimensions  of  physicians’  charges:  (1)  physician  time,  (2)  technical  overhead 
(office  staff,  rent,  and  supplies),  and  (3)  malpractice  insurance. 

•  Eliminates  distortions  in  current  RVs. 

•  Mitigates  differences  by  Specialty. 

•  Encourages  "cognitive"  medicine. 

Given  that  none  of  the  specific  systems  has  deficiencies  that  would  make  them  unusable,  the 
selection  of  one  method  should  reflect  the  DoD’s  desired  incentives.  Based  on  our  analysis,  we 
feel  that  APGs  is  the  best  case-mix  classification  system  for  use  by  the  DoD.  We  based  this 
decision  on  the  following  factors: 

•  Since  HCFA  will  be  adopting  APGs,  we  feel  that  the  DoD  should  also  adopt  them 
in  an  effort  to  maintain  consistency.  CHAMPUS  providers  will  be  using  APGs 
with  their  Medicare  patients  as  a  result  of  the  HCFA  move  and  will  not  want  to 
have  to  use  two  different  systems.  Also,  if  the  DoD  wants  to  be  able  to  compare 
their  data  with  the  rest  of  the  country  in  future  studies,  APGs  will  allow  for  this 
comparability. 

•  Based  on  the  full  list  of  criteria  in  the  matrix  (Exhibit  IV- 1),  APGs  come  out  on 
top  of  both  AWUs  and  AVGs. 
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•  APGs  are  the  next  generation  of  AVGs  and  are  more  flexible  due  to  procedure 
bundling. 

•  AVGs  are  no  longer  under  development,  whereas  APGs  continue  to  be  studied  and 
refined. 
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CHAPTER  I 


INTRODUCTION 


The  purpose  of  this  effort  was  to  assess  which  of  six  case-mix  classification  methodologies  was 
most  effective  as  a  too!  for  allocating  ambulatory  care  resources  within  the  DoD.  The  following 
case-mix  classification  schemes  were  tested  using  simulated  MTF-level  data: 

•  Ambulatory  Work  Units  (AWUs) 

•  CHAMPUS  visit  charges 

•  Ambulatory  Visit  Groups  (AVGs)  based  on  CHAMPUS  charges 

•  AVGs  based  on  the  Resource-Based  Relative  Value  Scale  (RBRVS) 

•  Ambulatory  Patient  Groups  (APGs)  based  on  CHAMPUS  charges 

•  APGs  based  on  the  RBRVS 

In  order  to  facilitate  a  comparative  assessment  of  these  competing  classification  schemes,  we 
created  12  case-mix  "simulations."  Of  these  12  simulations,  six  represent  the  ambulatory  patient 
case-mix  of  Medical  Treatment  Facilities  derived  from  the  demographic  profiles  of  actual 
facilities  and  morbidity  data  from  three  study  databases.  An  additional  six  simulations  were 
created  to  test  the  sensitivity  of  the  various  case-mix  classification  schemes  to  patient  population 
extremes.  These  simulations  were  as  follows: 


Children  (age  <  15  years) 

Adults  (age  45-64) 

Active  duty 

Complex  case-mix  visits 

Women  of  childbearing  age  (age  18-44  years) 

Over  65  years  of  age 


The  sensitivity  of  each  of  the  alternative  case-mix  classification  schemes  to  each  of  the  12 
simulations  was  determined. 


In  much  of  our  previous  work,1  we  have  focused  individually  on  many  of  the  prevailing  case- 
mix  classification  schemes.  The  alternatives  that  we  assessed  were  APGs,  AVGs,  and  Episodes 
of  Illness.  In  assessing  Episodes  of  Illness,  we  decided  that  the  inability  to  capture  all 
ambulatory  visits  for  any  given  illness  episode  made  it  impossible  to  develop  any  useful  resource 
allocation  scheme  at  this  time.  Therefore,  Episodes  of  Illness  were  not  included  in  our  current 
effort.  However,  as  part  of  the  APG  and  AVG  assessments,  we  were  able  to  value  resources  in 
terms  of  relative  values  for  our  three  DoD  databases2  using  CHAMPUS  dollars. 


^e  Appendix  A,  a  listing  of  our  prior  relevant  work. 

2These  databases  were  (1)  the  Uniformed  Services  Treatment  Facilities  (USTF)  database,  (2)  the 
Civilian  Health  and  Medical  Program  of  the  Uniformed  Services  (CHAMPUS)  database,  and  (3)  the 
Ambulatory  Care  Database  (ACDB). 
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Since  each  of  our  prior  reports  dealt  with  only  one  classification  scheme  at  a  time,  we  decided 
that  a  side-by-side  comparison  of  these  methodologies  versus  the  current  DoD  case-mix 
classification  system,  AWUs,  would  provide  valuable  information.  We  also  chose  to  use  both 
CHAMPUS  dollars  and  the  recently  promulgated  RBRVS  values  as  the  resource  outputs  for  each 
visit. 

In  summation,  the  present  effort  has  the  following  three  objectives: 

•  Simulate  the  health  care  utilization  of  actual  as  well  as  demographically  unique 
MHSS  service  populations. 

•  Conduct  a  comparative  assessment  of  alternative  case-mix  classification 
methodologies  in  describing  ambulatory  care  resource  use  within  simulated  service 
populations. 

•  Use  the  RBRVS  as  a  substitute  for  CHAMPUS  dollars  in  computing  resource  use. 

This  report  presents  the  methodology  used  to  perform  these  comparisons  along  with  the 
associated  results. 
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CHAPTER  II 
METHODOLOGY 


In  this  chapter,  we  present  the  steps  taken  in  formulating  the  12  simulations  used  in  our  analysis 
of  the  competing  case-mix  classification  schemes.  This  chapter  is  segmented  into  the  following 
sections: 

•  Creation  of  the  simulations 

•  Development  and  attachment  of  resource  measures 

•  Analytical  approach 

•  Limitations  of  the  simulation 


1.  CREATION  OF  THE  SIMULATIONS 


A  total  of  12  ambulatory  care  case- mix  simulations,  each  representing  the  morbidity  experience 
of  2,000  patients  over  a  one-year  period,  were  created.  Of  the  12  simulations,  six  were  modeled 
on  the  demographic  characteristics  of  actual  Medical  Treatment  Facilities.  The  age  and  sex 
distribution  of  these  MTF  populations  were  applied  to  an  unduplicated  file  of  patients  derived 
from  each  of  the  three  study  databases.  Simulation  patients  were  randomly  selected  and  their 
health  care  encounters  over  a  one- year  period  were  used  to  actually  simulate  case-mix.  In  an 
effort  to  provide  diversity  among  these  six  MTF  simulations,  we  decided  which  facilities  to 
simulate  based  upon  the  following: 


•  MTF  demographics  taken  from  Deers/RAPS 

•  The  need  for  a  representation  of  Army,  Navy,  and  Air  Force  facilities 

•  The  need  for  a  mix  of  Medical  Centers  and  Community  Hospitals 


In  order  to  highlight  the  reactions  of  the  classification  schemes  to  specific  populations,  the 
following  six  special  simulations  were  also  developed: 


•  Children  (aged  <  15  years) 

•  Adults  (aged  45-64  years) 

•  Active  duty  (These  patients  were  selected  from  the  ACDB  and  USTF  only,  since 
CHAMPUS  does  not  treat  active-duty  personnel.) 

•  Complex  case-mix  visits  (This  simulation  contains  all  visits  from  the  51  AVGs 
with  the  highest  relative  values  developed  in  the  CHAMPUS  database  using 
dollars  as  the  resource  measure.) 

•  Women  of  childbearing  age  (aged  18-44  years) 

•  Adults  over  65  years  of  age 
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The  following  narrative  describes  the  discrete  steps  that  were  required  to  prepare  the  12 
simulations. 

•  Step  1--We  created  a  patient  list  based  upon  the  unduplicated  patients  found  in 
the  USTF,  CHAMPUS,  and  ACDB  databases.  A  new  data  file  was  created  for 
these  patients  that  included,  in  addition  to  pertinent  demographic  and  health 
status  characteristics  drawn  from  the  parent  files,  variables  that: 

Designated  a  patient  age  and  sex  stratum  to  be  used  for  subsequent  random 
selection  to  reflect  desired  simulation  demographic  characteristics 

Classified  patients  as  active  duty  or  other 

•  Step  2--We  randomly  selected  2,000  patients  from  the  simulation  patient  selection 
list  for  each  of  the  scenarios  to  be  simulated  based  upon  their  age  and  sex  stratum 
and  beneficiary  status  (active  duty  or  other).  The  one  exception  to  this  was  the 
complex  case-mix  simulation,  which  was  based  on  visits  with  high  relative  values 
rather  than  demographics. 

•  Step  3--We  retrieved  all  of  the  ambulatory  care  encounters  for  the  selected 
patients  from  their  source  databases. 

•  Step  4--Since  only  one  of  our  databases  provided  encounter  data  for  exactly  one 
year,  we  employed  a  weighting  variable  to  create  an  approximation  of  one  year 
of  data.  The  weight  to  be  attached  to  visits  in  each  database  was  as  follows: 

USTF  x  1.00 

CHAMPUS  x  4.00 

ACDB  x  2  x  0.75  =  ACDB  x  1.50  (since  it  is  a  50  percent  sample  of  16 
months  of  data) 

•  Step  5- -Many  visits  in  the  USTF  and  ACDB  databases  contained  no  procedure 
information  whatsoever.  In  these  cases,  we  attempted  to  attach  procedures  based 
upon  clinical  norms  that  were  reflected  in  the  pooled  encounter  data  for  all  12 
simulations.  In  establishing  norms,  we  looked  at  sets  or  "packages”  of  procedures 
that  occurred  commonly.  A  procedure  "package"  was  chosen  at  random  from  the 
top  six  "packages"  for  that  age-sex  stratum  and  diagnosis.  Random  selection  was 
in  proportion  to  the  frequency  that  the  "package"  was  observed  to  occur.  Only  12 
percent  of  the  visits  actually  needed  to  have  procedures  added  using  this 
algorithm.  By  using  the  algorithm,  87  percent  of  these  visits  were  assigned 
procedures.  The  other  13  percent  still  do  not  contain  any  procedural  data  because 
they  either  had  no  recorded  diagnosis  or  they  had  a  unique  diagnosis. 
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There  was  one  deficiency  that  could  not  be  effectively  addressed  in  this  five-step  process.  The 
databases  (i.e.,  ACDB,  CHAMPUS,  and  USTF)  failed  to  adequately  reflect  pregnancy  visits 
because  these  visits  were  often  bundled  with  data  for  the  actual  delivery.  Therefore,  a 
methodology  was  created  for  adapting  the  simulation  for  women  of  childbearing  age  that  would 
reflect  these  visits.  After  randomly  selecting  the  2,000  patients  and  retrieving  all  of  their 
encounter  records,  the  following  extra  steps  were  undertaken: 

•  Step  1-- Patient  data  were  randomly  adjusted  to  reflect  age-specific  fertility  rates 
provided  by  the  National  Center  for  Health  Statistics. 

•  Step  2--Standard  initial  and  follow-up  prenatal  care  visits  were  constructed  based 
upon  guidelines  of  the  American  College  of  Obstetrics/Gynecology  and  the 
American  Public  Health  Association.  (See  Appendix  B  for  the  specifics  on  these 
visits.) 

•  Step  3--Pregnancies  by  months  to  full-term  delivery  were  uniformly  distributed 
across  all  12  months  and  encounters  assigned  based  upon  the  expected  visits  for 
each  trimester. 

2.  DEVELOPMENT  AND  ATTACHMENT  OF  RESOURCE  MEASURES 

After  the  patient  encounter  information  had  been  compiled  for  each  simulation,  we  developed 
and  attached  a  resource  weight  for  each  case-mix  classification  approach  to  all  visits  by 
executing  the  following  steps: 

•  Step  1--We  developed  relative  values  based  on  Hsiao’s  RBRVS  by  performing  the 
following  substeps: 

We  attached  Hsiao’s  RBRVS  to  each  procedure  in  the  CHAMPUS  database. 
The  RBRVS  weight  was  mapped  for  a  particular  CPT-4  procedure  code  to 
the  same  CPT-4  code  in  the  CHAMPUS  database. 

Using  the  RBRVS  as  the  resource  measure,  we  developed  AVG  and  APG 
weights.  The  method  employed  was  very  similar  to  that  used  in  the 
creation  of  weights  based  on  CHAMPUS  dollars.1  The  one  difference  was 
that  no  trimming  of  the  data  was  needed,  because  the  RBRVS  do  not 
contain  any  outliers. 

•  Step  2--We  then  grouped  all  of  the  visits  and  attached  each  of  the  resource 
measures  to  be  studied  by  performing  the  following  substeps: 

We  ran  the  data  through  the  ambulatory  care  case-mix  method  grouping 
software,  so  that  APGs  and  AVGs  could  be  assigned  to  all  of  the  visits  in 
the  simulations. 


'See  the  B&D  reports  entitled  Assessment  of  the  Utility  of  Ambulatory  Visit  Croups  (AVGs)  as  a 
Tool  for  Ambulatory  Resource  Allocation  Within  the  United  States  Military  Health  Care  System  and 
Assessment  of  the  Utility  of  Ambulatory  Patient  Croups  ( APGs)  as  a  Tool  for  Ambulatory  Resource 
Allocation  Within  the  United  States  Military  Health  Care  System. 
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We  attached  the  pertinent  mean  CHAMPUS  billed  charge  to  each 
procedure  code  in  the  USTF  and  ACDB. 

We  attached  CHAMPUS  dollar-based  AVG  and  APG  relative  values  to 
each  visit.  We  used  the  weights  that  we  had  developed  previously  using 
the  CHAMPUS  database. 

We  attached  RBRVS-based  AVG  and  APG  relative  values  to  each  visit. 
We  used  the  weights  that  were  developed  in  Step  1. 

We  calculated  the  total  APG  relative  weight  for  each  visit  based  upon  the 
discounting  methods  that  were  employed  by  the  Health  Care  Financing 
Administration.2  (See  Appendix  C  for  the  details  of  our  implementation 
of  these  methods.) 

We  attached  AWU  weights  that  corresponded  to  the  Uniform  Chart  of 
Accounts  (UCA)  clinic  code  of  each  visit  in  the  ACDB. 

Since  the  USTF  and  CHAMPUS  did  not  contain  UCA  codes,  we  performed 
the  following  four-step  process  to  attach  these  codes  along  with  their 
corresponding  AWU  weights: 

Substep  l--From  the  ACDB,  we  established  the  six  most 
f requently  occurring  UCA  codes  for  each  AVG  and  each  associated 
age-sex  stratum.  For  those  visits  in  which  UCA  codes  were 
missing  but  an  AVG  and  age-sex  stratum  were  available,  we 
randomly  assigned  a  UCA  from  among  the  top  six  in  proportion  to 
its  frequency  of  occurrence.  It  was  not  possible  to  assign  a  UCA 
approximately  2  percent  of  the  time  because  some  of  the  AVG  and 
age-sex  stratum  combinations  that  appeared  in  the  USTF  and 
CHAMPUS  did  not  occur  in  the  ACDB. 

Substep  2 — If  any  visit  with  CHAMPUS  as  its  source  still  needed 
a  UCA  code,  we  searched  all  CHAMPUS  visits  that  already  had 
UCA  codes  for  the  six  that  appeared  the  most  frequently  with  the 
given  provider  specialty  and  stratum  combination.  Then,  we 
randomly  selected  one  of  these  six  UCA  codes  and  attached  it  to 
the  visit.  However,  this  step  was  also  unable  to  assign  UCA  codes 
to  all  of  the  visits  from  CHAMPUS,  because  some  of  the  provider 
specialty  and  stratum  combinations  that  appeared  in  the  visits  that 
still  needed  UCA  codes  did  not  occur  in  the  visits  that  already  had 
UCA  codes. 

Substep  3--For  any  visits  that  continued  to  need  UCA  codes,  the 
search  scope  was  expanded  to  include  all  visits  in  all  databases  with 
UCA  codes.  While  this  approach  resulted  in  the  identification  of 


2Design  and  Evaluation  of  a  Prospective  Payment  System  for  Ambulatory  Care ,  Averill,  Richard  F„ 
et  al.,  December  31,  1990,  pp.  55-57. 
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several  new  AVG/age-sex  stratum  combinations,  there  was  still  a 
residual  number  of  visits  that  could  not  be  assigned  UCA  codes 
due  to  their  unique  AVG/stratum  category. 

Substep  4--We  repeated  Substep  3,  except  we  broadened  the  search 
to  be  based  solely  on  AVG  and  not  on  AVG  and  stratum.  By  doing 
this,  we  were  able  to  assign  a  UCA  code  to  any  visit  that  still 
needed  one,  unless  it  had  an  AVG  that  never  occurred  in  any  of 
the  visits  with  UCA  codes.  However,  this  was  very  rare. 

•  Step  3--Some  of  the  UCA  codes  never  had  AWU  weights  developed  for  them.  To 
rectify  this  problem,  we  developed  proxy  AWU  weights  for  these  unweighted 
UCA  codes  wherever  possible  by  executing  the  following  steps: 

We  performed  a  correlation  analysis  of  AWUs  with  other  resource 
measures  to  find  the  measure  with  the  best  correlation  with  AWUs  for 
existing  weighted  UCA  codes.  The  result  of  this  analysis  showed  that 
CHAMPUS  dollars  correlated  with  AWUs  better  than  any  of  the  other 
resource  measures. 

Where  a  sufficient  number  of  observations  existed,  a  simple  linear 
regression  was  used  to  estimate  an  AWU  that  most  closely  corresponded  to 
the  total  mean  CHAMPUS  dollar  value  for  those  UCA  codes  that  were 
without  AWU  weights. 

•  Step  4--We  attached  proxy  resource  measures  for  each  case-mix  system  where 
none  were  previously  attached.  For  each  simulation,  we  developed  these  proxy 
measures  based  on  the  mean  value  of  each  case-mix  system  as  calculated  when  the 
encounters  with  missing  values  were  excluded;  e.g.,  if  simulation  X  had  5,000 
visits,  and  100  of  those  visits  had  no  AVG  weights  attached,  we  first  computed 
the  mean  AVG  weight  for  the  4,900  visits  that  had  values  and  then  applied  the 
mean  AVG  weight  to  the  100  visits  that  needed  a  relative  value.  This  was  done 
to  minimize  the  effects  of  having  visits  within  each  simulation  that  were 
effectively  shown  to  require  no  resources. 

3.  ANALYTICAL  APPROACH 

Analysis  of  the  simulation  data  involved  two  steps.  First,  for  each  simulation,  the  top  10 
diagnoses,  procedures,  AVGs,  APGs,  and  UCAs  were  identified  and  a  demographic  profile 
prepared  (see  Appendix  D).  The  internal  consistency  of  these  data  was  considered,  e.g.,  that 
older  individuals  would  show  more  chronic  conditions  in  the  top  diagnoses.  Second,  standardized 
weights  for  each  case-mix  classification  scheme  were  calculated  for  each  simulation  so  that 
comparisons  between  these  approaches  could  be  made. 

These  weights  were  calculated  in  the  following  way: 

•  Step  1--We  calculated  the  total  resource  level  in  each  simulation  for  each  of  the 
case-mix  classification  systems  (e.g.,  the  AWU  weights  for  all  of  the  visits  in 
simulation  1  were  summed  together  to  get  the  total  AWU  resource  level  in 
simulation  1). 


i 
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Step  2--We  divided  the  total  resource  levels  for  each  of  the  six  case-mix 
classification  systems  by  the  total  number  of  visits  in  each  simulation  to  get  the 
mean  resource  level  per  visit  for  each  of  the  12  simulations.  The  resource  level 
per  visit  represents  the  total  system  weight  (e.g.,  if  the  total  resource  level  for 
AWUs  in  simulation  X  was  100  and  the  number  of  visits  in  simulation  X  was 
5,000,  then  the  resource  level  per  visit  for  AWUs  equals  0.02,  or  in  other  words, 
the  total  system  weight  for  AWUs  equals  0.02). 

Step  3--For  each  case-mix  classification  system,  we  calculated  the  sum  of  the 
total  system  weights  for  the  six  facility-specific  simulations.  Then  we  divided 
this  figure  by  six  to  get  the  mean  total  system  weight  for  each  of  the  case-mix 
systems.  For  example,  if  the  following  were  true. 

Simulation  Total  System  Weight  for  AWUs 


A 

B 

C 

D 

E 

F 


0.025 

0.023 

0.027 

0.029 

0.021 

0.025 


then  the  mean  total  system  weight  for  AWUs  would  be  0.15  divided  by  6;  this 
equals  0.025. 

Step  4--For  each  of  the  six  facility-specific  simulations,  we  computed  the 
standardized  weight  for  each  of  the  case-mix  classification  systems  by  dividing 
the  total  system  weight  for  the  given  simulation  by  the  mean  total  system  weight 
for  the  six  facility-specific  simulations  (as  calculated  above  in  Step  3).  This 
standardized  weight  was  computed  so  that  the  mean  value  for  the  facilities  would 
be  1,  with  a  value  greater  than  1  indicating  higher  than  average  resource  usage 
and  a  value  lower  than  1  indicating  lower  than  average  resource  usage  (e.g.,  if  we 
continue  with  the  example  presented  above  in  Step  3,  the  standardized  weight 
case-mix  index  for  the  AWU  classification  system  for  simulation  B  would  be 
equal  to  0.023  divided  by  0.025;  this  equals  0.92). 


Step  5-- For  each  of  the  special  scenario  simulations,  we  repeated  Step  4.  The 
reason  that  we  used  the  same  denominator  for  both  the  facility-specific  and 
special  scenario  simulations  was  so  that  the  standardized  weights  for  all  12 
simulations  would  be  on  a  comparable  scale  (e.g.,  continuing  our  example,  if  we 
had  a  special  scenario  simulation  G  with  a  mean  total  system  weight  of  0.100,  then 
the  standardized  weight  would  be  0.100  divided  by  0.025;  this  equals  4.0). 
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4. 


LIMITATIONS  OF  THE  SIMULATION 


The  limitations  of  this  simulation,  for  the  most  part,  stem  back  to  the  three  source  databases  (i.e., 
ACDB,  CHAMPUS,  and  USTF)  and  have  been  documented  in  past  reports.  Many  of  these 
limitations  have  been  addressed  and  corrected  by  the  methodology  presented  in  sections  one  and 
two  of  this  chapter.  The  identified  limitations  include  the  following: 

•  No  Charge  Information  In  The  ACDB  And  USTF--The  CHAMPUS  database 
contains  the  charge  for  each  procedure  performed.  By  calculating  the  mean  cost 
for  each  CPT-4  code  in  CHAMPUS,  we  were  able  to  "cost"  the  ACDB  and  USTF 
by  mapping  the  charges  based  on  CPT-4  codes. 

•  Lack  Of  Pregnancy  Visits  In  The  Databases--We  addressed  this  issue  by 
performing  the  three-step  process  described  above  in  Section  1  to  create  the 
Women  of  Childbearing  Age  simulation.  However,  the  rest  of  the  simulations  do 
not  adequately  reflect  pregnancies  due  to  their  small  presence  in  the  three  source 
databases. 

•  No  UCAs  In  CHAMPUS  And  The  USTF--Of  our  three  databases,  only  the  ACDB 
had  UCA  codes  recorded  for  each  visit.  To  correct  for  this  limitation,  the 
methodology  described  above  in  Step  2  of  Section  2  was  performed. 

•  Not  All  UCA  Codes  Had  AWUs  Developed  For  Them--The  UCAs  for  ancillary 
procedures  and  flight  medicine  never  had  AWUs  developed  for  them.  We 
developed  them  by  performing  Step  3  of  Section  2  as  described  above. 

•  RBRVS  Does  Not  Cover  All  Of  The  CPT-4  Codes  Yet  And  Does  Not  Cover 
Special  CHAMPUS,  ACDB,  And  USTF  Codes— The  developers  of  the  RBRVS 
have  weighted  the  majority  of  CPT-4  codes  but  have  not  yet  completed  all  of 
them.  Also,  our  three  databases  each  contain  their  own  unique  codes  that  have 
not  been  weighted  for  the  RBRVS.  Any  procedure  codes  that  did  not  have  an 
RBRVS  weight  were  simply  left  unweighted  for  lack  of  a  better  solution  at  this 
time. 

•  Not  All  Visits  Had  All  Resource  Measures  From  All  Of  The  Case-Mix  Systems 
Attached-Several  visits  were  without  resource  measures  for  all  six  of  the  case- 
mix  classification  schemes  for  various  reasons.  This  was  corrected  by  performing 
Step  4  of  Section  2. 

As  suggested  in  our  discussion,  we  were  able  to  overcome  almost  all  of  the  data  limitations  that 
we  encountered.  The  results  of  the  simulation  also  showed  excellent  face  validity  and  should 
prove  to  be  useful  to  the  DoD  in  choosing  a  case- mix  classification  system.  Although  the  data 
currently  available  proved  to  be  adequate  for  performing  this  simulation,  the  availability  of  more 
complete  and  current  data  could  only  strengthen  the  total  effort. 


Birch  &  Davis  Associates.  Inc. 


Page  11-7 


CHAPTER  III 
RESULTS 


The  12  simulations  can  be  divided  into  two  categories:  (1)  facility-specific  simulations  and  (2) 
special  scenario  simulations.  Each  of  these  categories  will  be  covered  separately.  The  results  of 
the  simulation  will  be  presented  in  terms  of  the  properties  of  each  simulation  and  the  effects  of 
applying  the  different  case-mix  classification  schemes  to  the  simulations 

1.  CHARACTERISTICS  OF  EACH  SIMULATION 

Before  we  could  undertake  any  analyses  regarding  the  case-mix  classification  schemes,  a 
thorough  assessment  of  the  validity  of  each  simulation  was  performed  in  terms  of  selected 
population  characteristics.  Simulated  population  characteristics  can  be  found  in  Appendix  D. 
Highlights  of  this  assessment  are  presented  below. 

1.1  Facility-Specific  Simulations 

All  of  the  facility  simulations  are  numbered  to  facilitate  the  presentation  of  results,  as  follows: 

•  Simulation  l--Community  Hospital 

•  Simulation  2--Medical  Center 

•  Simulation  3--Medical  Center 

•  Simulation  4--Community  Hospital 

•  Simulation  5--Community  Hospital 

•  Simulation  6--Community  Hospital 

The  demographics  of  these  simulations  were  tailored  to  reflect  the  service  populations  of  actual 
MTFs  and,  therefore,  show  good  face  validity.  Also,  since  the  populations  are  not  radically 
different,  it  would  be  expected  that  the  top  10  diagnoses  for  each  of  the  facility-specific 
simulations  should  be  very  similar.  This  is  indeed  the  case.  For  example,  all  of  these  simulations 
contain  neurotic  depression;  adjustment  reaction,  culture  shock;  unspecified  otitis  media;  allergic 
rhinitis,  cause  unspecified;  and  routine  infant  or  child  health  check  in  their  top  10  diagnoses. 
In  addition,  all  but  one  contain  allergic  rhinitis  and  essential  hypertension  in  their  top  10 
diagnoses.  Also,  the  proportional  distribution  of  these  diagnoses  contri,'’jte  to  the  face  validity 
of  these  simulations.  For  example,  the  two  facility-specific  simulations  with  more  than  500 
children,  simulations  1  and  5,  show  routine  infant  or  child  health  check  as  accounting  for  more 
than  3  percent  of  their  diagnoses.  The  other  four  facility-specific  simulations  show  this  same 
diagnosis  as  accounting  for  no  more  than  2.5  percent  of  their  diagnoses.  Similarly,  the  two 
simulations  with  over  50C  children  each  show  more  than  16.5  percent  of  their  visits  as  being  to 
pediatric  clinics.  On  the  other  hand,  the  other  four  simulations  show  no  more  than  14.6  percent 
of  their  visits  to  pediatric  clinics. 

1.2  Special  Scenario  Simulations 

The  simulation  numbers  and  the  special  scenarios  that  they  represent  are  as  follows: 

•  Simulation  7--Children 
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•  Simulation  8--Active  Duty 

•  Simulation  9--Women  of  Childbearing  Age 

•  Simulation  10--Adults 

•  Simulation  ll--Over  65  Years  of  Age 

•  Simulation  12--Complex  Case-Mix 

These  simulations  yielded  case-mix  patterns  that  were  commensurate  with  their  age  distributions. 

Examples  of  this  include  the  following: 

•  The  simulation  of  children  under  the  age  of  15  years  contained  routine  infant  or 
child  health  checks  and  emotional  disturbances  of  childhood  in  the  top  10 
diagnoses. 

•  The  simulation  of  active  duty  had,  as  expected,  sprains  and  strains  and  routine 
medical  examination  in  the  top  10  diagnoses. 

•  The  simulation  of  women  of  childbearing  age  had  supervision  of  normal  first 
pregnancy  as  the  most  frequently  occurring  diagnosis.  In  addition,  almost  20 
percent  of  the  visits  in  this  simulation  were  to  obstetrics  and  gynecological  care 
clinics. 

•  The  simulations  of  adults  and  of  people  over  65  years  of  age  showed  higher 
representation  by  chronic  healt!  conditions.  For  example,  hypertension  and 
diabetes  mellitus  were  two  of  the  most  frequently  seen  diagnoses  in  these 
simulations. 

•  The  two  clinics  most  frequently  visited  in  the  complex  case-mix  simulation  were 
medical  and  surgical  clinics.  This  is  as  expected,  because  of  the  nature  of  the 
most  frequently  occurring  procedures  in  this  simulation.  For  example, 
extracapsular  cataract  removal  with  lens  insertion  and  surgical  laparoscopy  were 
the  most  frequently  performed  procedures. 

2.  RESULTS  OF  ANALYSES  OF  THE  CASE- MIX  CLASSIFICATION  SCHEMES 

Analyses  of  the  results  of  applying  the  six  case-mix  classification  schemes  to  each  of  the 

simulations  provided  some  insights  into  the  characteristics  of  each  allocation  strategy. 

2.1  Facility-Specific  Simulations 

The  results  of  our  analysis  of  the  case-mix  classification  schemes  as  applied  to  the  facility- 

specific  simulations  are  shown  in  Exhibits  III- 1 ,  III-2,  and  III- 3. 

Exhibit  III- 1  shows  the  resource  intensity  associated  with  each  allocation  methodology  when 

applied  to  the  facility-specific  simulations.  This  exhibit  suggests  the  following: 

•  There  was  a  relatively  high  degree  of  homogeneity  between  allocation 
methodologies  in  terms  of  resource  intensity.  However,  the  effect  of  choosing  one 
alternative  over  another  could  change  the  level  of  a  resource  budget  by  plus  or 
minus  8  percent. 
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DIFFERENCE  BETWEEN  AWUs  AND  RBRVS-BASED  GROUPERS 
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•  AWUs  were  the  least  sensitive  to  facility-specific  differences  in  case-mix. 

•  Measures  based  on  CHAMPUS  billed  charges  are  most  sensitive  to  facility- 
specific  differences  in  case-mix,  as  one  should  expect,  because  of  the  overall 
greater  dispersion  of  raw  CHAMPUS  dollars. 

•  RBRVS-based  measures  show  a  constrained  variation  more  comparable  to  AWUs, 
because  the  raw  RBRVS  weights  do  not  vary  as  much  as  raw  CHAMPUS  dollars. 

Exhibits  III-2  and  III- 3  show  the  variation  of  AVGs  and  APGs  when  compared  to  AWUs,  the 
case-mix  classification  methodology  currently  used  by  the  DoD.  Exhibit  III-2  shows  AVGs  and 
APGs  based  on  CHAMPUS  dollars  and  Exhibit  III-3  shows  AVGs  and  APGs  based  on  the 
RBRVS.  Results  that  were  drawn  from  Exhibits  111-2  and  III-3  when  examined  individually 
include  the  following: 

•  RBRVS-based  APGs  and  AVGs  varv  by  plus  or  minus  2  percent  (see  Exhibit  III- 
3). 

•  CHAMPUS-based  APGs  and  AVGs  are  somewhat  more  variable  than  RBRVS- 
based  weights,  ranging  from  6  percent  to  minus  2  percent. 

•  For  three  simulations,  the  two  groupers  reacted  in  opposite  directions  from  the 
AWU  baseline;  in  three  other  cases,  they  moved  in  similar  directions.  No 
consistent  pattern,  or  underlying  causes,  were  identified  (see  Exhibit  III-3). 


•  Simulation  5  exhibited  a  large  difference  between  AVGs  and  APGs,  possibly  due 
to  a  high  number  of  psychotherapy  visits  (see  Exhibit  II1-3). 

Simultaneous  consideration  of  the  data  presented  in  Exhibits  1II-2  and  III-3  suggests  the 
following: 

•  For  simulation  1,  CHAMPUS-based  case-mix  methods  showed  lower  resource 
requirements  than  RBRVS-based  methods.  This  may  be  due  to  the  higher 
representation  of  children  and  the  associated  lower  chronic  morbidity  of  this 
population. 

•  For  simulation  5,  AVGs  were  much  more  generous  than  APG-based  case-mix 
methods. 


2.2  Special  Scenario  Simulations 

The  results  of  applying  the  case-mix  classification  schemes  to  the  special  scenario  simulations 
are  shown  in  Exhibits  III-4,  III-5,  III-6,  and  III- 7. 

Exhibit  III-4  shows  the  resource  intensity  for  each  allocation  methodology  when  applied  to  the 
special  scenario  simulations.  We  derived  the  following  insights  from  this  exhibit: 

•  As  with  the  facility  simulations,  the  following  results  were  once  again  evident. 

AWUs  were  the  least  sensitive  to  facility-specific  differences  in  case-mix. 
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RESOURCE  INTENSITY  BY  METHOD 
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DIFFERENCE  BETWEEN  AWUs  AND  GROUPERS 


AVG-C  AVG-R  APG-C  APG-R 


Measures  based  on  CHAMPUS  billed  charges  are  most  sensitive  to 
differences  in  case-mix. 


RBRVS-based  measures  show  a  constrained  variation  more  comparable  to 
AWUs. 

•  There  was  high  face  validity  in  terms  of  expected  behavior  of  the  case-mix 
methods. 

•  The  Over  65  Years  of  Age  simulation  is  the  highest  in  terms  of  resource  intensity 
regardless  of  the  case-mix  system  employed. 

•  Alternatively,  the  simulation  of  children  had  the  lowest  indices  of  resource 
intensity. 

Exhibits  III-5  and  I1I-6  show  the  variation  of  AVGs  and  APGs  when  compared  to  AWUs. 
Exhibit  III- 5  shows  AVGs  and  APGs  based  on  CHAMPUS  dollars,  and  Exhibit  III-6  shows 
AVGs  and  APGs  based  on  the  RBRVS.  Results  drawn  from  examining  these  exhibits  include 
the  following: 

•  The  difference  between  AWUs  and  the  groupers  was  sizable,  ranging  from  plus 
9  percent  to  minus  1 1  percent  based  upon  RBRVS  and  up  to  plus  30  percent  based 
upon  CHAMPUS  dollars. 

•  RBRVS-based  AVGs  appear  to  be  somewhat  more  favorable  to  children  than 
other  methods. 

•  Neither  APGs  nor  AVGs  offered  any  resourcing  advantages  to  active-duty 
personnel.  This  may  be  because  AWU  weights  are  based  upon  all  population 
segments  in  each  clinic.  Since  active-duty  personnel  generally  represent  the 
healthier  segment  of  a  population,  their  visits  usually  require  less  resources  than 
the  clinic  average. 

•  Only  RBRVS-based  APGs  were  not  advantageous  to  women  of  childbearing  age. 

•  For  the  simulation  of  adults,  RBRVS-based  groupers  indicated  lower  resource 
requirements  than  either  AWUs  or  CHAMPUS-based  groupers. 

•  For  the  Over  65  Years  of  Age  simulation,  APGs  reward  better  than  AWUs  or 
AVGs.  For  this  older  population,  APGs  reflect  the  higher  level  of  procedure- 
oriented  visits. 

Exhibit  III- 7  shows  the  resource  intensity  for  each  allocation  methodology  when  applied  to  the 
complex  case-mix  simulation.  Results  that  were  drawn  from  this  exhibit  include  the 
following: 

•  AWUs  were  the  least  sensitive  to  this  simulation. 
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•  RBRVS-based  methods  were  less  sensitive  to  the  complex  case-mix  visits  than  the 
CHAMPUS  billed  charge-based  methods,  once  again  due  to  the  more  constrained 
variation  of  the  raw  RBRVS  weights.  However,  both  RBRVS  and  CHAMPUS- 
based  methods  showed  much  higher  resource  levels  in  comparison  to  AWUs. 

•  The  simulation  of  complex  case-mix  visits  yielded  the  highest  resource  values  of 
all  the  simulations,  regardless  of  the  case-mix  system. 
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CHAPTER  IV 


CONCLUSIONS  AND  RECOMMENDATIONS 


CHAPTER  IV 

CONCLUSIONS  AND  RECOMMENDATIONS 


The  results  of  our  analysis  do  not  conclusively  point  to  any  of  the  case-mix  classification 
methodologies  as  being  clearly  superior.  Each  of  the  systems  has  its  merits  and  its  drawbacks, 
although  none  of  the  systems  has  a  weakness  that  is  serious  enough  to  completely  exclude  it  from 
consideration.  In  Exhibit  IV- 1,  we  have  listed  our  suggested  criteria  for  evaluating  the  case-mix 
systems  along  with  a  brief  evaluation. 

For  each  of  the  criteria  listed  in  the  table,  we  present  a  brief  description  of  our  rationale  in 
giving  the  alternative  case-mix  classification  systems  the  ratings  shown  in  the  table. 

•  Coherent  Categories  For  Clinician  Use--AVGs  and  APGs  were  designed  by 
physicians  for  physicians  with  the  intent  of  grouping  visits  on  a  clinical  basis. 
AWUs,  on  the  other  hand,  were  based  on  MEPRS  data  relevant  to  clinics.  Each 
clinic  type  aggregates  a  wide  variety  of  visits  and  not  a  coherent  set  of  visits. 

•  Homogeneity  Of  Categories-- A VGs  are  based  mainly  on  the  ICD-9  diagnosis 
code  while  APGs  rely  very  heavily  on  CPT-4  procedure  codes.  Each  AVG  and 
APG  shows  a  great  deal  of  homogeneity  based  on  these  codes.  AWUs,  on  the 
other  hand,  are  grouped  by  the  type  of  clinic  facility,  which  means  that  a  variety 
of  visits  could  fall  into  the  same  group.  For  example,  a  broken  leg  and  a 
pregnancy  visit  could  occur  in  the  same  group.  This  will  not  happen  with  AVGs 
or  APGs. 

•  Components  Of  Unit  Weights  Are  Known--An  APG  visit  weight  is  formed  by 
combining  the  weights  assigned  to  each  procedure  within  a  visit.  AVG  and  AWU 
visits  are  simply  assigned  to  a  group  and  given  that  group’s  weight.  Nothing  is 
known  about  the  components  of  the  visit  using  these  two  systems. 

•  HCFA  Will  Probably  Choose  This  Method--HCFA  is  currently  planning  to 
implement  APGs  for  Medicare  patients. 

•  Widely  Used--Since  APGs  are  going  to  be  used  by  HCFA,  they  will  most  likely 
be  adopted  industrywide.  AWUs  are  used  only  by  the  DoD,  and  AVGs  are  not 
currently  in  use. 

•  Easy  To  Use--AWUs  are  the  easiest  of  the  three  systems  to  use,  because  they  are 
driven  solely  by  the  number  of  patients  seen  in  a  clinic.  AVGs  are  also  easy 
because  a  visit  diagnosis  is  almost  all  that  is  needed  for  this  grouper  to  work. 
APGs,  on  the  other  hand,  are  the  hardest  to  use,  because  a  more  extensive  set  of 
input  data  is  needed.  However,  if  the  data  are  effectively  captured,  all  of  these 
systems  will  be  fairly  simple  to  implement. 

•  Encourage  Primary  Care/Prevention — AWUs  tend  to  reward  all  clinics  similarly; 
therefore,  they  encourage  low-intensity  visits  because  they  will  be  given  credit 
for  more  resources  than  they  actually  need. 
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EXHIBIT  IV-1 


SUGGESTED  CRITERIA  FOR  EVALUATING  CASE-MIX 
CLASSIFICATION  METHODS 


•  Reflects  Case-Mix--APGs  and  AVGs  naturally  reflect  case-mix  based  on  their 
comprehensive  set  of  groupings.  AWUs  do  not  reflect  case-mix  because  of  the 
wide  variety  of  visits  that  can  occur  in  a  given  clinic. 

•  Facilitates  Make-or-Buy  Decisions--Of  the  three  systems,  APGs  represent  the 
only  one  that  facilitates  these  decisions  because  each  visit  gets  weighted  at  the 
procedure  level.  This  will  allow  for  make-or-buy  comparisons  for  each 
component  of  a  visit.  AVGs  and  AWUs  place  a  weight  on  the  total  visit  and  do 
not  break  the  visits  down  into  pieces. 

•  Statistical  Process  Control--AVGs  were  originally  designed  to  be  used  for 
statistical  process  control,  not  for  cost  accounting.  AVGs  offer  statistical  process 
control  because  each  visit  will  have  a  total  resource  consumption  level  that  can  be 
compared  with  the  weight  for  the  group  under  which  the  visit  falls.  APGs  also 
offer  statistical  process  control--but  at  the  procedure  level  as  opposed  to  the  total 
visit  level.  With  AWUs  the  groups  are  not  clinically  meaningful,  which  means 
that  any  comparisons  between  visits  in  a  group  are  practically  meaningless. 

•  Utilization  Management- -Since  APGs  weight  each  procedure  that  is  performed, 
utilization  management  is  facilitated.  AVGs  also  facilitate  it,  but  to  a  lesser 
extent,  because  each  visit  is  weighted,  not  each  procedure. 

For  measuring  resources  using  AVGs  or  APGs,  we  feel  that  the  RBRVS  is  better  than  CHAMPUS 
dollars  because  it: 

•  Represents  a  more  comprehensive  approach  to  costing,  considering  three 
dimensions  of  physicians’  charges:  (1)  physician  time,  (2)  technical  (office  staff, 
rent,  and  supplies),  and  (3)  malpractice  insurance. 

•  Eliminate  distortions  in  current  RVs. 

•  Mitigate  differences  by  specialty. 

•  Encourage  "cognitive"  medicine. 

Given  that  none  of  the  specific  systems  has  deficiencies  that  would  make  them  unusable,  the 
selection  of  one  method  should  reflect  the  DoD’s  desired  incentives.  Although  we  have  created 
the  table  (Exhibit  IV- 1)  of  our  suggested  criteria  for  evaluating  the  case-mix  systems,  we  have 
not  determined  the  relative  importance  of  each  item.  The  significance  of  each  criterion  depends 
on  the  DoD’s  desired  incentives.  For  example,  if  the  items  under  the  managed  care  heading  in 
Exhibit  IV- 1  are  deemed  to  be  the  most  important,  then  APGs  would  clearly  be  a  better  choice 
than  AWUs  or  AVGs.  However,  if  ease  of  use  is  deemed  to  carry  the  most  weight,  then  AWUs 
would  be  the  obvious  choice. 

Based  on  our  analysis,  we  feel  that  APGs  are  the  best  case-mix  classification  system  for  use  by 
the  DoD.  We  based  this  decision  on  the  following  factors: 

•  Since  HCFA  will  be  adopting  APGs,  we  feel  that  the  DoD  should  also  adopt  them 
in  an  effort  to  maintain  consistency.  CHAMPUS  providers  will  be  using  APGs 
with  their  Medicare  patients  as  a  result  of  the  HCFA  move  and  will  not  want  to 
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have  to  use  two  different  systems.  Also,  if  the  DoD  wants  to  be  able  to  compare 
their  data  with  the  rest  of  the  country  in  future  studies,  APGs  will  allow  for  this 
comparability. 

•  Based  on  the  full  list  of  criteria  in  the  matrix,  APGs  come  out  on  top  of  both 
AWUs  and  AVGs. 

•  APGs  are  essentially  the  next  generation  of  AVGs,  with  more  flexibility  due  to 
service  bundling. 

•  AVGs  are  now  static,  whereas  APGs  continue  to  be  studied  and  refined. 
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PROCEDURES  AND  DIAGNOSIS  USED  FOR  PREGNANCY  VISITS 


Initial  Visit 

Diagnosis:  V22.0:  Supervision  of  normal  pregnancy 

Procedures:  90020:  Office  and  other  outpatient  medical  service;  comprehensive  service 

84702:  Gonadotropin,  chorionic;  quantitative 

85021:  Blood  count;  hemogram,  automated  (RBC,  WBC,  Hgb,  Hct,  and 
indices  only) 

86592:  Syphilis  test;  qualitative  (e.g.,  VDRL,  RPR,  ART) 

88150:  Cytopathology,  smears,  cervical  or  vaginal  (e.g.,  Papanicolaou),  up 
to  three  smears;  screening  by  technician  under  physician 
supervision 

87110:  Culture,  chlamydia 

86083:  Blood  typing;  ABO,  Rh(D),  and  RBC  antibody  screening 
81000:  Urinalysis  (pH,  specific  gravity,  protein,  tests  for  reducing 
substances  such  as  glucose);  with  microscopy 
87205:  Smear,  primary  source,  with  interpretation;  routine  stain  for 
bacteria,  fungi,  or  cell  types 


Follow-up  Visit 

Diagnosis:  V22.0:  Supervision  of  normal  pregnancy 

Procedure:  59420:  Antepartum  care 
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METHODOLOGY  FOR  COMPUTING  THE  TOTAL 
APG  WEIGHT  PER  VISIT 


APPENDIX  C 

METHODOLOGY  FOR  COMPUTING  THE  TOTAL 
APG  WEIGHT  PER  VISIT 


The  following  methodology  is  based  on  the  3M/HCFA  APG  report.  The  weights  that  were 

attached  to  each  APG  were  developed  using  the  full  CHAMPUS  encounter  database. 

Test  for  the  type  of  visit: 

•  Significant  Procedure  Visit  (PATTYPE  =  1) 

•  Ancillary  Visit  (PATTYPE  =  2) 

0  Medical  Visit  (PATTYPE  =  3) 

If  it  is  a  Significant  Procedure  Visit,  then  do  the  following: 

•  Search  for  all  significant  procedure  APGs  (PAPGTYPE  *  1)  that  are  not 
consolidated  (CONS  =  0)  and  have  a  relative  weight  attached. 

•  Once  all  of  these  APGs  have  been  located,  order  them  from  highest  relative 
weight  to  the  lowest. 

•  Total  significant  procedure  APG  weight  *  (significant  procedure  with  the  highest 
relative  weight)  +  (0.4  x  the  significant  procedure  with  the  second-highest  relative 
weight)  +■  (0.2  x  the  sum  of  the  relative  weights  of  all  remaining  significant 
procedure  APGs). 

•  Search  for  all  ancillary  procedure  APGs  (PAPGTYPE  =  2)  that  are  not  packaged 
(ANCPK  =  0)  and  have  a  relative  weight  attached. 

•  Determine  if  these  APGs  are  lab  test  APGs  or  nonlab  test  APGs.  A  lab  test  is 
defined  as  APGs  417  to  440  and  a  nonlab  test  is  defined  as  APGs  341  to  392  or 
APGs  443  to  461 . 

•  Total  ancillary  procedure  APG  weight  *  the  sum  of  all  ancillary  procedure  APG 
weights,  with  the  conditions  that  if  any  lab  test  occurs  more  than  once,  the 
subsequent  occurrences  are  multiplied  by  0.8  and  if  any  nonlab  test  occurs  more 
than  once,  the  second  occurrence  is  multiplied  by  0.4  and  any  occurrence 
thereafter  is  multiplied  by  0.2. 

•  Total  APG  visit  weight  =  total  significant  procedure  APG  weight  +  total  ancillary 
procedure  APG  weight. 

If  it  is  an  Ancillary  Visit,  then  do  the  following: 

•  Search  for  all  ancillary  procedure  APGs  (PAPGTYPE  ■  2)  that  are  not  packaged 
(ANCPK  ■  0)  and  have  a  relative  weight  attached. 
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•  Determine  if  these  APGs  are  lab  test  APGs  or  nonlab  test  APGs.  A  lab  test  is 
defined  as  APGs  417  to  440  and  a  nonlab  test  is  defined  as  APGs  341  to  392  or 
APGs  443  to  461. 

•  Total  ancillary  procedure  APG  weight  -  the  sum  of  all  ancillary  procedure  APG 
weights,  with  the  conditions  that  if  any  lab  test  occurs  more  than  once  the 
subsequent  occurrences  are  multiplied  by  0.8  and  if  any  nonlab  test  occurs  more 
than  once,  the  second  occurrence  is  multiplied  by  0.4  and  any  occurrence 
thereafter  is  multiplied  by  0.2. 

•  Total  APG  visit  weight  =  total  ancillary  procedure  APG  weight. 

If  it  is  a  Medical  Visit,  then  do  the  following: 

•  If  the  assigned  medical  APG  has  a  relative  weight,  then  store  it  as  the  total 
medical  APG  weight. 

•  Search  for  all  ancillary  procedure  APGs  (PAPGTYPE  =  2)  that  are  not  packaged 
(ANCPK  =  0)  and  have  a  relative  weight  attached. 

•  Determine  if  these  APGs  are  lab  test  APGs  or  nonlab  test  APGs.  A  lab  test  is 
defined  as  APGs  417  to  440  and  a  nonlab  test  is  defined  as  APGs  341  to  392  or 
APGs  443  to  461. 

•  Total  ancillary  procedure  APG  weight  -  the  sum  of  all  ancillary  procedure  APG 
weights,  with  the  conditions  that  if  any  lab  test  occurs  more  than  once  the 
subsequent  occurrences  are  multiplied  by  0.8  and  if  any  nonlab  test  occurs  more 
than  once,  the  second  occurrence  is  multiplied  by  0.4  and  any  occurrence 
thereafter  is  multiplied  by  0.2. 

•  Total  APG  visit  weight  =  total  medical  APG  weight  +  total  ancillary  procedure 
APG  weight. 


i 
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